AITopics | ks distance

Collaborating Authors

ks distance

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

febefe1cc5c87748ea02036dbe9e3d67-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-15-2026, 10:08:23 GMT

constraint, corruption distribution, inductive move, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.52)

Add feedback

3c057cb2b41f22c0e740974d7a428918-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 07:16:24 GMT

One important limitation of VAEs is the strong prior assumption that latent representations learned by the model followasimpleuni-modal Gaussian distribution. Further,thevariational training procedure poses considerable practical challenges. Recently proposed regularized autoencoders offeradeterministic autoencoding framework, that simplifies the original VAE objective and is significantly easier to train. Since these models only provide weak control over the learned latent distribution, they require an ex-post density estimation step to generate samples comparable to those of VAEs. Inthispaper,wepropose asimple andend-to-end trainable deterministic autoencoding framework, that efficiently shapes the latent space of the model during training and utilizes the capacity of expressive multi-modal latent distributions. The proposed training procedure provides direct evidence if the latent distribution adequately captures complexaspects oftheencoded data.

artificial intelligence, autoencoder, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions

McWhorter, Atticus, DeFord, Daryl

arXiv.org Artificial IntelligenceOct-28-2025

Novel Markov Chain Monte Carlo (MCMC) methods have enabled the generation of large ensembles of redistricting plans through graph partitioning. However, existing algorithms such as Reversible Recombination (RevReCom) and Metropolized Forest Recombination (MFR) are constrained to sampling from distributions related to spanning trees. We introduce the marked edge walk (MEW), a novel MCMC algorithm for sampling from the space of graph partitions under a tunable distribution. The walk operates on the space of spanning trees with marked edges, allowing for calculable transition probabilities for use in the Metropolis-Hastings algorithm. Empirical results on real-world dual graphs show convergence under target distributions unrelated to spanning trees. For this reason, MEW represents an advancement in flexible ensemble generation. Introduction Recent advances in computational capabilities have greatly increased legislators' abilities to optimize political redistricting plans.

artificial intelligence, machine learning, target distribution, (18 more...)

arXiv.org Artificial Intelligence

2510.17714

Country: North America > United States > New Hampshire (0.16)

Genre: Research Report (0.83)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning

Tonguz, Ozan K., Taschin, Federico

arXiv.org Artificial IntelligenceOct-21-2025

One of the major problems in Machine Learning (ML) and Artificial Intelligence (AI) is the fact that the probability distribution of the test data in the real world could deviate substantially from the probability distribution of the training data set. When this happens, the predictions of an ML system or an AI agent could involve large errors which is very troublesome and undesirable. While this is a well-known hard problem plaguing the AI and ML systems' accuracy and reliability, in certain applications such errors could be critical for safety and reliability of AI and ML systems. One approach to deal with this problem is to monitor and measure the deviation in the probability distribution of the test data in real time and to compensate for this deviation. In this paper, we propose and explore the use of Kolmogorov-Smirnov (KS) Test for measuring the distribution shift and we show how the KS distance can be used to quantify the distribution shift and its impact on an AI agent's performance. Our results suggest that KS distance could be used as a valuable statistical tool for monitoring and measuring the distribution shift. More specifically, it is shown that even a distance of KS=0.02 could lead to about 50\% increase in the travel time at a single intersection using a Reinforcement Learning agent which is quite significant. It is hoped that the use of KS Test and KS distance in AI-based smart transportation could be an important step forward for gauging the performance degradation of an AI agent in real time and this, in turn, could help the AI agent to cope with the distribution shift in a more informed manner.

ks distance, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2510.15996

Country: North America > United States > Utah (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Transportation > Ground > Road (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Consistent Estimation of Numerical Distributions under Local Differential Privacy by Wavelet Expansion

Zhao, Puning, Zhang, Zhikun, Sun, Bo, Shen, Li, Zhang, Liang, Wang, Shaowei, Liu, Zhe

arXiv.org Artificial IntelligenceSep-25-2025

Distribution estimation under local differential privacy (LDP) is a fundamental and challenging task. Significant progresses have been made on categorical data. However, due to different evaluation metrics, these methods do not work well when transferred to numerical data. In particular, we need to prevent the probability mass from being misplaced far away. In this paper, we propose a new approach that express the sample distribution using wavelet expansions. The coefficients of wavelet series are estimated under LDP. Our method prioritizes the estimation of low-order coefficients, in order to ensure accurate estimation at macroscopic level. Therefore, the probability mass is prevented from being misplaced too far away from its ground truth. We establish theoretical guarantees for our methods. Experiments show that our wavelet expansion method significantly outperforms existing solutions under Wasserstein and KS distances.

artificial intelligence, estimation, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2509.19661

Country: Asia > China (0.68)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

febefe1cc5c87748ea02036dbe9e3d67-AuthorFeedback.pdf

Neural Information Processing SystemsAug-20-2025, 11:26:41 GMT

constraint, corruption distribution, inductive move, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.52)

Add feedback

Adjusting Regression Models for Conditional Uncertainty Calibration

Gao, Ruijiang, Yin, Mingzhang, McInerney, James, Kallus, Nathan

arXiv.org Machine LearningSep-25-2024

Conformal Prediction methods have finite-sample distribution-free marginal coverage guarantees. However, they generally do not offer conditional coverage guarantees, which can be important for high-stakes decisions. In this paper, we propose a novel algorithm to train a regression function to improve the conditional coverage after applying the split conformal prediction procedure. We establish an upper bound for the miscoverage gap between the conditional coverage and the nominal coverage rate and propose an end-to-end algorithm to control this upper bound. We demonstrate the efficacy of our method empirically on synthetic and real-world datasets.

conditional coverage, conformal prediction, non-conformity score, (12 more...)

arXiv.org Machine Learning

2409.17466

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
North America > United States > California > Santa Clara County > Los Gatos (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.85)

Add feedback

Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data

Yang, Yaoqing, Theisen, Ryan, Hodgkinson, Liam, Gonzalez, Joseph E., Ramchandran, Kannan, Martin, Charles H., Mahoney, Michael W.

arXiv.org Artificial IntelligenceJun-4-2023

Selecting suitable architecture parameters and training hyperparameters is essential for enhancing machine learning (ML) model performance. Several recent empirical studies conduct large-scale correlational analysis on neural networks (NNs) to search for effective \emph{generalization metrics} that can guide this type of model selection. Effective metrics are typically expected to correlate strongly with test performance. In this paper, we expand on prior analyses by examining generalization-metric-based model selection with the following objectives: (i) focusing on natural language processing (NLP) tasks, as prior work primarily concentrates on computer vision (CV) tasks; (ii) considering metrics that directly predict \emph{test error} instead of the \emph{generalization gap}; (iii) exploring metrics that do not need access to data to compute. From these objectives, we are able to provide the first model selection results on large pretrained Transformers from Huggingface using generalization metrics. Our analyses consider (I) hundreds of Transformers trained in different settings, in which we systematically vary the amount of data, the model size and the optimization hyperparameters, (II) a total of 51 pretrained Transformers from eight families of Huggingface NLP models, including GPT2, BERT, etc., and (III) a total of 28 existing and novel generalization metrics. Despite their niche status, we find that metrics derived from the heavy-tail (HT) perspective are particularly useful in NLP tasks, exhibiting stronger correlations than other, more popular metrics. To further examine these metrics, we extend prior formulations relying on power law (PL) spectral distributions to exponential (EXP) and exponentially-truncated power law (E-TPL) families.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2202.02842

Country: North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

K-medoids Clustering of Data Sequences with Composite Distributions

Wang, Tiexing, Li, Qunwei, Bucci, Donald J., Liang, Yingbin, Chen, Biao, Varshney, Pramod K.

arXiv.org Machine LearningJul-30-2018

This paper studies clustering of data sequences using the k-medoids algorithm. All the data sequences are assumed to be generated from \emph{unknown} continuous distributions, which form clusters with each cluster containing a composite set of closely located distributions (based on a certain distance metric between distributions). The maximum intra-cluster distance is assumed to be smaller than the minimum inter-cluster distance, and both values are assumed to be known. The goal is to group the data sequences together if their underlying generative distributions (which are unknown) belong to one cluster. Distribution distance metrics based k-medoids algorithms are proposed for known and unknown number of distribution clusters. Upper bounds on the error probability and convergence results in the large sample regime are also provided. It is shown that the error probability decays exponentially fast as the number of samples in each data sequence goes to infinity. The error exponent has a simple form regardless of the distance metric applied when certain conditions are satisfied. In particular, the error exponent is characterized when either the Kolmogrov-Smirnov distance or the maximum mean discrepancy are used as the distance metric. Simulation results are provided to validate the analysis.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

1807.1162

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > New York > Onondaga County > Syracuse (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(7 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.96)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Modeling & Simulation (0.67)

Add feedback